Illegal forest encroachment represents a critical challenge in the Bamori Range, Guna District, Madhya Pradesh. The conversion of forested land into agricultural areas disrupts biodiversity and threatens ecological stability. This study presents a predictive model, utilizing machine learning, to identify areas prone to illegal land-use changes. By employing spatial and temporal datasets within a Random Forest framework, the model achieved a high accuracy of 94% and an ROC AUC score of 0.9796. Key predictors, such as the \"Rate of Spread\" and \"Neighbor Crop Count,\" emerged as significant drivers of encroachment, providing actionable insights for forest management. This approach offers a data-driven solution to enhance conservation strategies and mitigate environmental degradation.
Introduction
I. INTRODUCTION
A. Background and Context
Illegal conversion of forest lands into agricultural fields is a persistent and complex issue, particularly in this region. This encroachment disrupts ecological balance, and undermines conservation efforts. While remote sensing and geospatial analysis have been used extensively for monitoring land-use changes, their potential for proactive risk prediction remains underutilized. Incorporating predictive modeling can provide forest managers with the necessary foresight to address encroachment effectively.
B. Objective of the Study
This study aims to create a data-driven model to forecast illegal land-use changes in future years. By using a Random Forest algorithm that integrates historical patterns and spatial data, this model provides forest management with a powerful tool to combat encroachment effectively.
C. Significance
Accurate predictions can direct resources where they are most needed. Identifying areas prone to illegal activities can help forest managers implement targeted interventions, raise community awareness, and strengthen preventive measures.
II. LITERATURE REVIEW
A. Prior Studies
Previous research in India has often utilized remote sensing and GIS tools to track land-use changes. Machine learning models like Random Forests have shown potential for handling complex datasets. However, integrating temporal indicators and spatial context remains underexplored in predictive models for illegal encroachment
B. Addressing Gaps
This study bridges the gap by combining spatial and temporal features, improving predictive accuracy and providing practical tools for forest management.
III. DATA AND STUDY AREA
A. Study Area
The Bamori Range, located in Guna District, Madhya Pradesh, is characterized by rich forest cover. However, ongoing encroachment for agricultural purposes poses a significant threat to this region.
B. Data Sources
The land cover classification rasters used in this study were generated using a hyperparameter-tuned Random Forest model described in [Land Cover Classification in Bamori Range using Machine Learning Algorithms: A Comparative Study]. This model utilized Sentinel-2 imagery, Dynamic World probabilities, and spectral indices to classify land cover into five distinct classes. Each raster represents the land cover classes for the respective year, with consistent classification schemes across years, including classes for 'Forest', 'Crop', 'Crop fallow', 'Water bodies', and 'Built Up areas'.
IV. METHODOLOGY
A. Model Selection
A Random Forest classifier was chosen for its robustness, ability to handle complex datasets, and effectiveness in modeling non-linear relationships. Its reliability and resistance to overfitting made it ideal for predicting illegal forest encroachment.
B. Feature Engineering
1) Temporal Features
LULC Classes (2019–2022): Historical land cover data for each year to track patterns.
Change Indicators: Binary variables marking land use changes between consecutive years (2019–2022).
2) Spatial Features
Neighbor Crop Count: Tracks the number of surrounding pixels classified as agricultural.
Directional Changes: Summarizes the spread of encroachment in neighboring pixels.
Rate of Spread: Counts how often a pixel's land class changed between 2019 and 2023
C. Spatial Feature Calculation
Neighbor Identification: The eight surrounding pixels for each forested pixel were identified to capture influences from nearby areas.
Neighbor Crop Count: Calculated by analyzing land cover of neighboring pixels to assess agricultural pressure.
Directional Changes: Tracks encroachment trends in nearby pixels.
D. Model Training and Validation
Data Splitting: A 70: 30 train-test split ensured balanced class representation.
Threshold Adjustment: A 0.35 threshold maximized recall, ensuring most encroachment cases are flagged.
V. RESULTS
A. Model Performance
The Random Forest classifier achieved the following performance metrics on the test set:
Accuracy: 94%
Precision (Encroachment): 89%
Recall (Encroachment): 100%
F1 Score (Encroachment): 94%
ROC AUC Score: 0.9796
The model's ROC curve (in the appendix) demonstrates excellent discriminative ability.
Confusion Matrix
Predicted Not Encroachment
Predicted Encroachment
Actual No Encroachment
581,004
77,869
Actual Encroachment
1,790
6 657,083
B. Feature Importance
The analysis of feature importances revealed the following:
Rate of Spread: 44.21%
LULC_2022: 18.40%
Neighbor Crop Count: 15.91%
LULC_2020: 4.17%
LULC_2021: 3.76%
VI. DISCUSSION
A. Interpretation of Results
The model demonstrates strong predictive capabilities, identifying areas at risk of illegal encroachment by combining historical land cover changes and spatial context. Its high recall ensures that most potential encroachment cases are flagged, allowing forest managers to focus resources effectively.
B. Limitations
Predictions rely on LULC data quality and may not account for unexpected factors like policy changes or natural disasters.
C. Future Work
Enhancements include integrating socioeconomic data, and dynamically updating the model with new data.
Conclusion
This study demonstrates the power of machine learning in addressing environmental challenges. By predicting illegal encroachment in the Bamori Range, the Random Forest model empowers forest managers with actionable insights, contributing to long-term conservation efforts.
References
[1] Pal, M. (2005). Random forest classifier for remote sensing classification. International Journal of Remote Sensing, 26(1), 217222.
[2] Rawat, J. S., & Kumar, M. (2015). Monitoring land use/cover change using remote sensing and GIS techniques: A case study of Hawalbagh block, district Almora, Uttarakhand, India. The Egyptian Journal of Remote Sensing and Space Science, 18(1), 7784.
[3] Joshi, P. K., Roy, P. S., Singh, S., Agrawal, S., & Yadav, D. (2016). Vegetation cover mapping in India using multitemporal IRS Wide Field Sensor (WiFS) data. Remote Sensing of Environment, 103(2), 190202.
[4] Patel, N. R., Kumar, S., & Shekhawat, R. S. (2020). Modeling land use land cover change using geospatial techniques and Markov chain analysis in a semiarid region of India. Environmental Earth Sciences, 79(19), 115.